Inventi Impact: Cloud Computing

Articles

Inventi:ecc/24/12

HYBRID CLOUD AND CLUSTER COMPUTING PARADIGMS FOR LIFE SCIENCE APPLICATIONS

31-Dec-2011 Research 2012 : January - March

Judy Qiu, Jaliya Ekanayake, Thilina Gunarathne, Jong Y Choi, Seung-Hee Bae, Hui Li, Bingjing Zhang, Tak-Lon Wu, Yang Ruan, Saliya Ekanayake, Adam Hughes, Geoffrey Fox

Background\r\nClouds and MapReduce have shown themselves to be a broadly useful approach to scientific computing especially for parallel data intensive applications. However they have limited applicability to some areas such as data mining because MapReduce has poor performance on problems with an iterative structure present in the linear algebra that underlies much data analysis. Such problems can be run efficiently on clusters using MPI leading to a hybrid cloud and cluster environment. This motivates the design and implementation of an open source Iterative MapReduce system Twister.\r\nResults\r\nComparisons of Amazon, Azure, and traditional Linux and Windows environments on common applications have shown encouraging performance and usability comparisons in several important non iterative cases. These are linked to MPI applications for final stages of the data analysis. Further we have released the open source Twister Iterative MapReduce and benchmarked it against basic MapReduce (Hadoop) and MPI in information retrieval and life sciences applications.\r\nConclusions\r\nThe hybrid cloud (MapReduce) and cluster (MPI) approach offers an attractive production environment while Twister promises a uniform programming environment for many Life Sciences applications.\r\nMethods\r\nWe used commercial clouds Amazon and Azure and the NSF resource FutureGrid to perform detailed comparisons and evaluations of different approaches to data intensive computing. Several applications were developed in MPI, MapReduce and Twister in these different environments.

How to Cite this Article
CC Compliant Citation: Qiu et al.: Hybrid cloud and cluster computing paradigms for life science applications. BMC Bioinformatics 2010 11(Suppl 12):S3. doi:10.1186/1471-2105-11-S12-S3
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Cloud Computing

Articles

Inventi:ecc/24/12

HYBRID CLOUD AND CLUSTER COMPUTING PARADIGMS FOR LIFE SCIENCE APPLICATIONS

How to Cite this Article

Links

Contact Us